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SUMMARY 

2  2 

Let  X  ^  N  (u,  1),  p  2s  3,  where  £  *  diag(o, ,  ....  o  )  and  let 

~  P  **  *  P 

2  2 

s./o.  ^  X  (i  ■  1,  ...»  p)s  independent  of  X.  This  paper  obtains 

•  i  ^ 

a  class  of  mininax  estimators  <5  for  £  subject  to  the  loss  func¬ 
tion  (6  -  u)'Q(£  -  ji)/tr(QE)  where  Q  is  a  p  *  p  diagonal  matrix 
with  unknown  positive  diagonal  elements.  It  is  assumed  that  an 

A*  1  .1 

independent  estimator  Q  of  Q  is  available  which  satisfies  cer¬ 
tain  conditions.  The  new  minimax  estimator  is  a  function  of 
*-l 

X,  s^,  and  Q  ,  and  takes  a  form  similar  to  the  minimax  estimator 

obtained  by  Berger  and  Bock  (Ann.  Statist.  £  (1976),  642-648)  for 

the  case  when  Q  is  known.  A  class  of  minimax  estimators  for  g  is 

2  2 

also  obtained  for  the  special  case  when  Oj  ■  ...  »  op. 

*This  work  was  supported  by  the  Army,  Navy  and  Air  Force  under 
Office  of  Naval  Research  Contract  No.  N00014-80-C-0093.  Repro¬ 
duction  in  whole  or  in  part  is  permitted  for  any  purpose  of  the 
United  States  Government. 

AMS  1970  subject  classifications:  Primary  62C07;  Secondary  62F10, 
62H99 

Key  words  and  phrases.  Minimax,  unknown  variances,  quadratic  loss 
with  unknown  weights,  risk  function. 


1.  INTRODUCTION 

Let  X  •  (X  ,  ,  X  )'  be  a  p-vLriate  normal  random  vector 

1  p  \\ 

with  mean  ji  and  covariance  matrix  £.  This  paper  considers  the 

,  .LU* 

minimax  estimation  of  by  ^  relative  to  a  certain  quadratic  loss 
function  with  unknown  weights.  To  the  best  of  our  knowledge,  this 
is  the  first  time  in  the  literature  a  loss  function  of  this  type 
is  considered  in  estimating^.  The  minimax  estimation  of  jj  rela¬ 
tive  to  other  types  of  quadratic  loss  functions  has  been  extensively 
studied  since  Stein  (19S6)  showed  that  the  maximum  likelihood  esti- 
mator,  X,  is  inadmissible,  when  p  i  3,  relative  to  the  loss  func¬ 
tion  given  by  <  -  •  r 


V &i  u* s)  “  (.&  -  l O'*  (&  -  n)' 


(l.i) 


In  particular,  minimax  estimators  for  Ji  relative  to  (1.1)  are  ob¬ 
tained  by  Baranchik  (1970)  when  z  is  known,  by  Strawderman  (1973) 

2 

when  £  »  a  1^,  and  by  Lin  and  Tsai  (1973)  and  Efron  and  Morris  (1976) 
when  £  is  unknown,  among  other  authors.  In  the  1970's,  researchers 
shifted  their  concern  to  the  minimax  estimation  of  v  relative  to 
the  loss  function  given  by 


L2<£;  !d»  r>  ■  ($  -  uTQ(£  -  u)/tr(Q£),  (1.2) 

where  Q  *  diag(q^,  ...,  q^)  under  the  assumption  that  q^,  ...,  q^  are 
known  positive  constants.  Successful  results  are  reported  by  Berger 
(1976a,  b),  Berger  and  Bock  (1976),  Berger  et  al  (1977),  and 


Gleser  (1979),  among  others.  Recently,  Lin  and  Mousa  (1981a,  b), 
in  studying  the  minimax  estimation  under  both  loss  functions,  take 
a  convex  combination  of  (1.1)  and  (1.2)  as  the  loss  function  and 
obtain  classes  of  minimax  estimators  forg  under  the  assumption 
that  I  is  an  unknown  positive  definite  diagonal  matrix. 

In  this  paper  we  will  obtain  a  class  of  mininax  estimators 

for  relative  to  the  loss  function  given  by  (1*2)  where  q^ . q 

are  unknown  positive  weights.  These  positive  weights  should  re¬ 
flect  the  relative  importance  of  the  parameters  to  be  estimated  in 
such  a  way  that  the  more  important  the  component  to  be  estimated 
is,  the  more  weight  should  be  placed  on  that  component  in  the  loss 
function.  Usually,  the  parameters  to  be  estimated  are  not  equally 
important  and,  quite  often,  their  relative  importance  is  not  com¬ 
pletely  known,  resulting  in  an  unknown  assignment  of  weights.  Since 
the  real  weight  matrix  Q  is  unknown,  we  will  assume  that  an  estimator 
Q  of  Q  (or  Q1  of  Q1)  is  available  and  that  Q  is  independent  of  the 
underlying  distribution.  Examples  in  which  Q  is  unknown  are 

(1)  Let  Z, ,  ...»  Z  be  a  random  sample  taken  from  N  (r,  D  where 

*il  p  * 

both  £  and  r  are  unknown.  Consider  the  problem  of  estimating  £  with 

_  n 

respect  to  the  loss  L.(6;  £,  r).  Let  Z  »  (1/n)  l  Z. .  It  is  well 

i  ~  ~  i-r1 

n  _  _ 

known  that  W  ■  £  (Z^  -  Z) (Z^  -  Z)'  has  a  Wishart  distribution  with 

parameters  n  -  1  and  T,  independent  of  the  Z^'s.  Thus,  W/(n  -  1) 
is  an  unbiased  estimator  of  T. 


(2)  Let  X^,  . ..,  be  a  randon  sample  taken  from  ^ (£,  £) • 

Partition  g^,  £,  and  £  as  in  Stein  (I960),  i.e.. 


k 


►  ^ 
n 

a  r 

» 

•  £  ■ 

,  and  £  ■ 

h. 

* 

b  r 

where  Yj,  n  and  A  are  1  >  1,  Z^,  £  and  B  are  p  x  1  vectors,  and  r  is 
an  unknown  p  *  p  positive  definite  matrix.  Then, 


*  fi  +  a’  i  *  1,  •••#  P» 


where 


£ 


r_1B 


and 


n- 


Now,  consider  the  problem  of  estimating  the  regression  vector  g 
with  respect  to  the  loss 

L(B.  £)*(£“  fl)'r(|  -  g)/(A  -  B'r_1B).  (1.3) 

Note  that  r/(A  -  B'r-1B)  is  unknown  and  it  may  be  estimated  by 
V/(T  -  IT  v"  Xip .  Where 


T 

J i 


U' 

V 


•J^St  -  i)(*i  -I)'. 


I  ■  a/”)  I  h 

i-1 


(3)  Consider  the  problem  of  estimating  the  means  ...,  Vp 
(p  2  3)  of  certain  products.  Let  q^,  . ...  qp  be  the  demands  for 
the  products  which  are  unknown  but  which  may  be  estimated,  e.g. , 


by  some  leading  economic  index  or  by  some  other  method.  The  larger 
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the  demand  is  for  a  product  the  more  important  it  will  be  to 

estimate  the  mean  product  accurately  so  that  its  unit  price  may 

be  fairly  adjusted  in  the  market.  The  use  of  loss  function  (1.2) 

with  unknown  weights  in  this  context  is  not  uncommon;  in  fact*  it 

also  can  be  found  in  psychology*  education*  and  social  research 

in  which  an  independent  source  of  information*  that  reflects  the 

relative  importance  of  the  parameters  to  be  estimated*  is  available. 

The  above  examples  suggest  that*  in  practice  *  the  matrix  Q  * 

*-l 

need  not  be  known  rather  that  it  can  be  estimated  by  Q  which 
has  finite  second  moments. 

In  the  following  section,  a  new  class  of  mini max  estimators 

will  be  obtained  for  y,  when  the  covariance  matrix 

X  »  diag(Oj,  ....  Op)  with  unknown  o?*  i  *  1,  ...*  p.  The  special 
2  2 

case  when  a.  »  ...  »  a  will  be  treated  in  Section  3. 


-  6  - 

2.  MINIMAX  ESTIMATION  WHEN  I  »  DIAG(o?»  ....  a2) 

1  P 

Let  X  •  (X, .  ...»  X J '  be  a  p-variate  normal  vector  (p  a  3) 

1  P 

with  unknown  mean  vector  ji  *  (u^»  ...»  u  ) '  and  unknown  diagonal 

2  2  ^  2  2 

covariance  matrix  Z  *  diag (r J',  ....  <y“).  Assume  that  s^/cr^  ^  , 

i  •  1,  . ..,  p,  and  Sj,  ...»  sp  are  mutually  independent,  and  are 
independent  of  X.  It  is  desired  to  estimate  the  mean  vector  p 
under  the  loss  (1.2)  where  Q  is  a  p  x  p  diagonal  matrix  with  un- 

A 

known  diagonal  elements,  >  0,  i  *  1,  . ..,  p.  Let  Q  be  a  p  x  p 
diagonal  random  matrix  with  diagonal  elements  q^,  i  ■  1,  ...,  p, 
independent  of  X  and  s^,  such  that  for  i  *  1,  ...,  p, 

Efaj/Qi)  *  ci#  (2.1a) 

and 

E(VV2  *  di*  (2*lb) 

with  c^  and  d^  being  known  positive  constants.  Let 

C  ■  diag (Cj,  ....  cp)  (2.2a) 

and 

D  ■  diag(d^,  dp).  (2.2b) 

Assume  that 


8  •  <*  W"1) 


(2.5) 
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is  known.  Define  w^  »  Sj/(n^  *  2),  *  *  •••»  P*  and  let 

if  ■  diag(w^,  ...,  Wp).  Then  If  is  independent  of  X  and  Q. 

For  an  estimator  £(£,  If,  $)  let  R(6;  jj,  E)  «  Ew  x  qL2  («;  jj,  E) 
denote  the  risk  function.  It  is  easily  verified  that  the  maximum 
likelihood  estimator,  X,  is  minimax  relative  to  the  loss  function 
given  by  (1.2)  with  constant  risk  equal  to  1.  Thus  an  estimator 
£(&*  *f»  Q)  will  be  minimax  if  and  only  if  R(£;  E)  s  1  for  all 
U  and  E.  In  this  section,  the  estimator  6(X,  N,  Q)  given  by  (2.4) 
will  be  shown  to  be  minimax  by  proving  that  R(X;  jj,  E)  -  R(<5;  ji,  E)  2  0 
for  all  u  and  E.  The  following  lemmas  are  useful  in  evaluating  the 
difference  in  risks;  they  are  stated  here  without  proof. 

Lemma  2.1.  [Stein  (1974,  1981)].  Let  Y  ~  N(0,  1)  and  let  g 
be  an  absolutely  continuous  function,  g:  R  ♦  R  such  that 
g(y)exp(-y2/2)  ♦  0  as  y  ♦  ±».  Then 

EyUrC*)]  »  Ey[Yg (Y)]. 

2 

Lemma  2.2.  [Efron  and  Morris  (1976)].  Let  U  *  ^  and  let 
g  be  an  absolutely  continuous  function,  g:  R*  +  R*  such  that 
g(u)un^2exp(-u/2)  +  0  as  u  ■»  0+  or  as  u  ♦  •.  Then 

EulUgOJ)]  *  nEytgOJ)]  ♦  2Ey [Ug'(U) ] . 

Corollary  2.2.1.  [Berger  and  Bock  (1976)].  Let  U  and  g  be 
as  defined  in  Lemma  2.2.  Let  Z  «  cU/(n  -  2),  c  >  0,  and 
h(Z)  ■  g[(n  -  2)Z/c].  Then 

Ez[(n  -  2)Zh(Z)/c]  -  nEz(h(Z)]  ♦  2Ez(Zh'(Z)]. 


Lemma  2.3.  [Lehmann  (1966)].  Let  S  be  any  random  variable, 
and  let  p^(S)  and  p2(S)  map  the  real  line  into  itself.  If  p^(S) 
and  P2(S)  are  either  both  nonincreasing  in  S  or  both  nondecreasing 
in  S,  then 

Es[Pi(s)P2(s)]  a  Es[Pl(S)]Es[p2(S)]. 

The  above  lemmas  have  been  frequently  used  to  establish  the 

minimaxi ty  of  an  estimator  for  a  multivariate  normal  mean;  they 

are  included  here  for  ease  of  reference. 

The  following  theorem  will  prove  the  minimaxity  of  5(X,  W,  q) 

given  by  (2.4).  In  the  theorem  we  will  use  tr(A)  and  ch  (A) 

(or  <31^^  (A) )  to  denote  the  trace  and  the  maximum  (or  minimum) 

characteristic  root  of  a  square  matrix  A.  As  in  Berger  and  Bock 

(1976),  let  T  *  min  (xl  /n4)  and  t  =  rfo,,  ....  nj  -  E(T-1). 
lSi£p  ni  1  1  P 

Theorem  2.4.  The  estimator 

£(x,  W,  Q)  -  [Ip  -  r(X,  W)  ||x||JJ2Q* V1)^,  (2.4) 

2  E  2  2 

where  ||&|L  *  l  X./w. ,  is  minimax  for  ja,  provided  that 
"  i-1  1  1 

(i)  0  S  r(*.  W)  s  2[tr(C)  -  2t  chnax(c)]/e 

with  tr(C)  a  2t  ch  (C), 
max 

(ii)  r(X,  W)  is  nondecreasing  in  |X^|,  i  ■  1 . . 

(iii)  r(X,  N)  is  nonincreasing  in  w^,  i  ■  1,  ...»  p,  and 

(iv)  r(X,  W)||x||w  is  nondecreasing  in  w^,  i  «  1,  ...,  p. 


P  X4(X.-uJq 


The  minimaxity  of  «(*,  W,  Q)  will  be  established  if  we  can  show  that 
(2.5)  a  0.  To  evaluate  the  expectation  in  (2.5)  we  proceed  as 
follows.  The  first  term  in  the  last  expression  of  (2.5)  becomes 


-  10  - 


The  equality  in  (2.6)  is  obtained  by  an  application  of  Lerana  2.1 

with  y±  •  (XA  -  vi)/ai  and  g(y^  »  rl|x|liJ2oici(oiyi  * 
and  the  inequality  follows  from  Assumption  (ii) .  Now  let 
U.  *  (nA  -  2)wi/<j?  and  MWj)  »  r/{ || Xll^w^) ,  i  ■  1,  ....  p.  Then 
by  an  application  of  Corollary  2.2.1,  it  follows  that 


2o2c 

ii 


-  2)  11*11  J 


(2.7) 


Note  that  identity  (2.7)  is  similar  to  (2.3)  of  Berger  and  Bock 


(1976)  with  their  qj  *  •••  3  <1p  3  i* 

Substituting  (2.6)  and  (2.7)  into  (2.5)  and  noting  that 
(3r/3wi)  £  0  for  i  ■  1,  ...,  p,  by  Assumption  (iii),  the  last 
expression  of  (2.5)  is  bounded  by 


e_x  1^9-- 

M  [l|x||2w 


2r 

uiiij 


P 

l 

i*l 


2y2 

ViVi 

(ni-2)wj 


l 


J|I2|||(  i.l  KjQj 


2r  ch_  (C) 


-  I  llxllw  Mw  lSiipl(ni‘Z,Wi(  i*l  wZ  2||x||;  i=l  w‘q 


e  Bbql  2r8(W)ch^(C) 

v-x  plf  ‘  Il2lg 


2 llxllw  J 


Vx  {t'Hilli2!  *  fcWCWC)  '  t[}.  (2-8) 

(  ni°^ 

e  g(W)  ■  max  7 - sr~  is  nonincreasing  inw.,  i  >  1,  p, 

lsisp  Uni“^wiJ  1 


where  g(W)  ■  max  I- 
lsisp  l1 

and  B  »  ch  (DQ-1). 

msuk 


Now  (2.5)  2  0  if  we  can  show  that  (2.8)  2  0.  But  Lena  2.3 
implies  that  the  last  expression  of  (2.8)  is  further  bounded  below 


E*  {v'l*#;2)  [«« -  “w«wdwc> .  f  £„(.)]} 

Ex{  EwOrl|  X|] ;2J  [tr(C)  -  Zxch^tC)  -  f  £„(,)]} 


which  is  nonnegative  by  Assumption  (i)  of  Theorem  2.4.  This  esta¬ 
blishes  that  (2.5)  *  0  and  the  theorem  is  proved.  □ 

Note  that  the  theorem  will  be  vacuous  unless  2t  £  tr(C)/ch  (C) 

max 

The  values  of  t  can  be  calculated  from  a  formula,  due  to  Berger 
and  Bock  (1976),  if  the  n. 's  are  even.  Recall  that  t  ■  eT max  (n./v2 
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Then, 

U  'Vi'1”1]  cvi)i.‘-,W-1  '■ 

. 

P 

where  m^  ■  n^/2,  m  *  £  m^,  J(k)  =  J  j(i),  and  the  inner  summa- 

i«*l  ixk 

ticm  is  over  all  combinations  (j(l),  j(2),  ...»  j(k-l),  j(k*l),  . .., 

j(p)]  where  the  j (i)  are  integers  between  1  and  m. ,  inclusive. 

On  the  other  hand,  for  the  important  special  case  when  C  3  cl^,  c  >  0, 

tr(C)/[2ch|naTCQ]  ■  p/2.  In  this  case,  2t  £  p,  which  has  been 

shown  by  Berger  and  Bock  (1976)  to  hold  when  p  2  3  and  n-  2  N  for 

some  large  positive  integer  N. 

Remark.  The  assumption  that  B  ■  ch _ (DQ-1)  is  known  seems 

"  max 

undesirable.  However,  since  the  loss  (1.2)  is  invariant  with  res¬ 
pect  to  a  same  positive  scale  change  on  all  elements  of  Q,  we  may 
without  loss  of  generality  assume  that  ch^M)  *  1  in  (1*2).  Thus 
B  s  chMir (D)  and  Assumption  (i)  of  Theorem  2.4  may  be  replaced  by 
(i')  0  £  r(X,  W)  s  2(tr(C)  -  Zxch^ffN/di^D),  with 
tr(C)  2  2TchMjt(C) . 


T  * 


P 

l  l 

k>l 


3.  MINIMAX  ESTIMATION  WHEN  E  ■  o  I  . 

P 

In  this  section  we  will  consider  the  special  case  when 

a  ...  *  o2  ■  o2,  say.  Specifically*  let  X  ■  (X  ,  . ..,  X  )" 

*  P  1  P 

be  a  p-variate  normal  vector  (p  i  3)  with  unknown  mean  vector 

2 

u  and  unknown  covariance  matrix  E  ■  o  I  .  Assume  that 

P 

2  2 

s/c  **  Xn  (n  2  3) ,  independent  of  X.  It  is  desired  to  estimate 

the  mean  vector  Ji  under  the  loss  (1.2)  where  Q  is  a  p  *  p  diagonal 

matrix  with  unknown  diagonal  elements  q.  >  0,  i  «  l,  ...»  p. 

Let  Q  be  as  defined  in  Section  2  satisfying  conditions  (2.1)  and 

* 

(2.2).  Define  w  *  s/(n-2).  Then  w  is  independent  of  X  and  Q. 

In  the  following  theorem,  an  estimator  6  which  is  a  function  of 

A  2 

X,  w,  and  Q  is  shown  to  be  minimax  with  R(£;  ji,  a  )  si  for  all 
H  and  o2. 

Theorem  3.1.  The  estimator 

«(X,  w.  Q)  -  [Ip  -  r(X,  W)||X||;2Q“V1]X, 

2  2 

where  ||x||w  *  X'X/w  ,  is  minimax  for  the  mean  vector  ji,  provided 
that 

(i)  0  S  r(X,  w)  s  2  [tr(C)  -  2  ch^CH/S.  with 

tr(C)  i  2  chmax(C),  n  a  3,  and  B  « 

(ii)  r(X,  w)  is  nondecreasing  in  |X.J,  i  »  1,  ...»  p, 

(iii)  r(X,  w)  is  nonincreasing  in  w,  and 

(iv)  r(X,  w)||x|Jw^  is  nondecreasing  in  w. 
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(X,  w).  Then  the  difference  between  the 
except  for  the  factor  2/tr(Ql),  is 


2  111  II VJ 


^  (X.-u^q. 


I  X?,, 


i«l  qi 


r  r  1 

iFi.i  «i2  . 


r 

I— r—  l  x.(x.-u.)c. 

l]l£Hw-  i.i  1 


*  j?d. 


2n*ny 


nii 

i  J  ’ 


(3.1) 


which  is  similar  to  (2.S).  To  prove  that  £  is  minimax,  it  is 

2 

sufficient  to  show  that  (3.1)  is  2  0,  for  all  ji  and  0  .  This  may 
be  established  by  simply  modifying  the  proof  of  (2.5).  The  details 
are  omitted.  Q 

The  remark  given  at  the  end  of  Section  2  applies  to  this 
case  as  well.  Thus  Assumption  (i)  of  Theorem  3.1  may  be  replaced 


(i')  0  s  r(X,  w)  s  2^j  [tr(C)  -  2  ch^Ql/ch^D) , 

where  tr(C)  *  2  ch  fC)  and  n  *  3. 
max 

The  theorem  is  not  vacuous  since  for  the  special  case  when 

C  •  cl  ,  c  >  0,  the  upper  bound  of  f(X,  w)  reduces  to 
P  ~ 

2[(n-2)/(n+2)]c(p-2)/chMx(D)  which  is  nonnegative  for  all  n  a  3 
and  p  2  3. 
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ABSTRACT  (Continue  on  reverse  side  if  necessary  and  identify  by  block  number 

Let  X  -V  N  (ji,  £),  p  2  3,  where  E  =  diagfa^,  ...»  cQ  and  let  s ^/ai  ^  Xn 
P  P  ni 

(i  *  1,  . p) ,  independent  of  X.  This  paper  obtains  a  class  of  minimax  esti¬ 
mators  6  for  j i  subject  to  the  loss  function  (6  -  j£)'Q(6  -  jj)/tr(QE)  where  Q  is 
a  p  x  p  diagonal  matrix  with  unknown  positive  diagonal  elements.  It  is  assumed 
that  an  independent  estimator  of  Q  *  is  available  which  satisfies  certain 

a_i 

conditions.  The  new  miniraax  estimator  is  a  function  of  X,  s.,  and  Q  ,  and  takes 

a  form  similar  to  the  minimax  estimator  obtained  by  Berger  and  Bock  (Ann.  Statist. 

4  (1976),  642-648)  for  the  case  when  Q  is  known.  A  class  of  minimax  estimators 

2  o 

for  j»  is  also  obtained  for  the  special  case  when  o.  ■  ...  ■  o  . 


